300 research outputs found
An Intelligent System For Arabic Text Categorization
Text Categorization (classification) is the process of classifying documents into a predefined set of categories based on their content. In this paper, an intelligent Arabic text categorization system is presented. Machine learning algorithms are used in this system. Many algorithms for stemming and feature selection are tried. Moreover, the document is represented using several term weighting schemes and finally the k-nearest neighbor and Rocchio classifiers are used for classification process. Experiments are performed over self collected data corpus and the results show that the suggested hybrid method of statistical and light stemmers is the most suitable stemming algorithm for Arabic language. The results also show that a hybrid approach of document frequency and information gain is the preferable feature selection criterion and normalized-tfidf is the best weighting scheme. Finally, Rocchio classifier has the advantage over k-nearest neighbor classifier in the classification process. The experimental results illustrate that the proposed model is an efficient method and gives generalization accuracy of about 98%
A generic open world named entity disambiguation approach for tweets
Social media is a rich source of information. To make use of this information, it is sometimes required to extract and disambiguate named entities. In this paper we focus on named entity disambiguation (NED) in twitter messages. NED in tweets is challenging in two ways. First, the limited length of Tweet makes it hard to have enough context while many disambiguation techniques depend on it. The second is that many named entities in tweets do not exist in a knowledge base (KB). In this paper we share ideas from information retrieval (IR) and NED to propose solutions for both challenges. For the first problem we make use of the gregarious nature of tweets to get enough context needed for disambiguation. For the second problem we look for an alternative home page if there is no Wikipedia page represents the entity. Given a mention, we obtain a list of Wikipedia candidates from YAGO KB in addition to top ranked pages from Google search engine. We use Support Vector Machine (SVM) to rank the candidate pages to find the best representative entities. Experiments conducted on two data sets show better disambiguation results compared with the baselines and a competitor
The self-consistent bounce: an improved nucleation rate
We generalize the standard computation of homogeneous nucleation theory at
zero temperature to a scenario in which the bubble shape is determined
self-consistently with its quantum fluctuations. Studying two scalar models in
1+1 dimensions, we find the self-consistent bounce by employing a two-particle
irreducible (2PI) effective action in imaginary time at the level of the
Hartree approximation. We thus obtain an effective single bounce action which
determines the rate exponent. We use collective coordinates to account for the
translational invariance and the growth instability of the bubble and finally
present a new nucleation rate prefactor. We compare the results with those
obtained using the standard 1-loop approximation and show that the
self-consistent rate can differ by several orders of magnitude.Comment: 28 pages, revtex, 7 eps figure
WKB approximation for inflationary cosmological perturbations
A new method for predicting inflationary cosmological perturbations, based on
the Wentzel-Kramers-Brillouin (WKB) approximation, is presented. A general
expression for the WKB scalar and tensor power spectra is derived. The main
advantage of the new scheme of approximation is that it is valid even if the
slow-roll conditions are violated. The method is applied to power-law
inflation, which allows a comparison with an exact result. It is demonstrated
that the WKB approximation predicts the spectral indices exactly and the
amplitude with an error lower than 10%, even in regimes far from
scale-invariance. The new method of approximation is also applied to a
situation where the slow-roll conditions hold. It is shown that the result
obtained bears close resemblance with the standard slow-roll calculation.
Finally, some possible improvements are briefly mentioned.Comment: 11 pages, 1 figure, RevTeX; minor changes, reference added (v2);
typos corrected (v3
Monte Carlo reconstruction of the inflationary potential
We present Monte Carlo reconstruction, a new method for ``inverting''
observational data to constrain the form of the scalar field potential
responsible for inflation. This stochastic technique is based on the flow
equation formalism and has distinct advantages over reconstruction methods
based on a Taylor expansion of the potential. The primary ansatz required for
Monte Carlo reconstruction is simply that inflation is driven by a single
scalar field. We also require a very mild slow roll constraint, which can be
made arbitrarily weak since Monte Carlo reconstruction is implemented at
arbitrary order in the slow roll expansion. While our method cannot evade
fundamental limits on the accuracy of reconstruction, it can be simply and
consistently applied to poor data sets, and it takes advantage of the attractor
properties of single-field inflation models to constrain the potential outside
the small region directly probed by observations. We show examples of Monte
Carlo reconstruction for data sets similar to that expected from the Planck
satellite, and for a hypothetical measurement with a factor of five better
parameter discrimination than Planck.Comment: 10 pages, 5 figures (RevTeX 4) Version submitted to PRD: references
added, minor clarification
The delta-function-kicked rotor: Momentum diffusion and the quantum-classical boundary
We investigate the quantum-classical transition in the delta-kicked rotor and
the attainment of the classical limit in terms of measurement-induced
state-localization. It is possible to study the transition by fixing the
environmentally induced disturbance at a sufficiently small value, and
examining the dynamics as the system is made more macroscopic. When the system
action is relatively small, the dynamics is quantum mechanical and when the
system action is sufficiently large there is a transition to classical
behavior. The dynamics of the rotor in the region of transition, characterized
by the late-time momentum diffusion coefficient, can be strikingly different
from both the purely quantum and classical results. Remarkably, the early time
diffusive behavior of the quantum system, even when different from its
classical counterpart, is stabilized by the continuous measurement process.
This shows that such measurements can succeed in extracting essentially quantum
effects. The transition regime studied in this paper is accessible in ongoing
experiments.Comment: 8 pages, 4 figures, revtex4 (revised version contains much more
introductory material
Additive and Multiplicative Noise Driven Systems in 1+1 Dimensions: Waiting Time Extraction of Nucleation Rates
We study the rate of true vacuum bubble nucleation numerically for a phi^4
field system coupled to a source of thermal noise. We compare in detail the
cases of additive and multiplicative noise. We pay special attention to the
choice of initial field configuration, showing the advantages of a version of
the quenching technique. We advocate a new method of extracting the nucleation
time scale that employs the full distribution of nucleation times. Large data
samples are needed to study the initial state configuration choice and to
extract nucleation times to good precision. The 1+1 dimensional models afford
large statistics samples in reasonable running times. We find that for both
additive and multiplicative models, nucleation time distributions are well fit
by a waiting time, or gamma, distribution for all parameters studied. The
nucleation rates are a factor three or more slower for the multiplicative
compared to the additive models with the same dimensionless parameter choices.
Both cases lead to high confidence level linear fits of ln(nucleation time) vs.
1/T plots, in agreement with semiclassical nucleation rate predictions.Comment: 38 pages, 20 figures, 6 table
Search for a W' boson decaying to a bottom quark and a top quark in pp collisions at sqrt(s) = 7 TeV
Results are presented from a search for a W' boson using a dataset
corresponding to 5.0 inverse femtobarns of integrated luminosity collected
during 2011 by the CMS experiment at the LHC in pp collisions at sqrt(s)=7 TeV.
The W' boson is modeled as a heavy W boson, but different scenarios for the
couplings to fermions are considered, involving both left-handed and
right-handed chiral projections of the fermions, as well as an arbitrary
mixture of the two. The search is performed in the decay channel W' to t b,
leading to a final state signature with a single lepton (e, mu), missing
transverse energy, and jets, at least one of which is tagged as a b-jet. A W'
boson that couples to fermions with the same coupling constant as the W, but to
the right-handed rather than left-handed chiral projections, is excluded for
masses below 1.85 TeV at the 95% confidence level. For the first time using LHC
data, constraints on the W' gauge coupling for a set of left- and right-handed
coupling combinations have been placed. These results represent a significant
improvement over previously published limits.Comment: Submitted to Physics Letters B. Replaced with version publishe
Search for new physics in events with opposite-sign leptons, jets, and missing transverse energy in pp collisions at sqrt(s) = 7 TeV
A search is presented for physics beyond the standard model (BSM) in final
states with a pair of opposite-sign isolated leptons accompanied by jets and
missing transverse energy. The search uses LHC data recorded at a
center-of-mass energy sqrt(s) = 7 TeV with the CMS detector, corresponding to
an integrated luminosity of approximately 5 inverse femtobarns. Two
complementary search strategies are employed. The first probes models with a
specific dilepton production mechanism that leads to a characteristic kinematic
edge in the dilepton mass distribution. The second strategy probes models of
dilepton production with heavy, colored objects that decay to final states
including invisible particles, leading to very large hadronic activity and
missing transverse energy. No evidence for an event yield in excess of the
standard model expectations is found. Upper limits on the BSM contributions to
the signal regions are deduced from the results, which are used to exclude a
region of the parameter space of the constrained minimal supersymmetric
extension of the standard model. Additional information related to detector
efficiencies and response is provided to allow testing specific models of BSM
physics not considered in this paper.Comment: Replaced with published version. Added journal reference and DO
Measurement of the Lambda(b) cross section and the anti-Lambda(b) to Lambda(b) ratio with Lambda(b) to J/Psi Lambda decays in pp collisions at sqrt(s) = 7 TeV
The Lambda(b) differential production cross section and the cross section
ratio anti-Lambda(b)/Lambda(b) are measured as functions of transverse momentum
pt(Lambda(b)) and rapidity abs(y(Lambda(b))) in pp collisions at sqrt(s) = 7
TeV using data collected by the CMS experiment at the LHC. The measurements are
based on Lambda(b) decays reconstructed in the exclusive final state J/Psi
Lambda, with the subsequent decays J/Psi to an opposite-sign muon pair and
Lambda to proton pion, using a data sample corresponding to an integrated
luminosity of 1.9 inverse femtobarns. The product of the cross section times
the branching ratio for Lambda(b) to J/Psi Lambda versus pt(Lambda(b)) falls
faster than that of b mesons. The measured value of the cross section times the
branching ratio for pt(Lambda(b)) > 10 GeV and abs(y(Lambda(b))) < 2.0 is 1.06
+/- 0.06 +/- 0.12 nb, and the integrated cross section ratio for
anti-Lambda(b)/Lambda(b) is 1.02 +/- 0.07 +/- 0.09, where the uncertainties are
statistical and systematic, respectively.Comment: Submitted to Physics Letters
- âŚ